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C^ 1 Abstract 

-t— > \ 

In this paper we estimate the parameters of the qubit Pauli channel using the 

channel matrix formalism. The main novelty of this work is that we do not assume 

q-i the directions of the Pauli channel to be known, but they are determined through 

the tomography process, too. The results show that for optimally estimating the 

contraction parameters and the channel matrix we should have input qubits and 

measurements in the channel directions. However, for optimally estimating the 

^J , channel directions, we should use different tomography conditions. 

Keywords: experiment design, Pauli channel, parameter estimation, measurement, 

■"nt" ! quadratic error, qubit. 

O 

CO i 

1 Introduction 

The accurate description of different quantum phenomena is a key issue in their potential 
use in modern IT-applications. In quantum mechanics, both dynamical changes and 
communication is treated using quantum channels. Therefore the parameter estimation 
of quantum channels plays a major role in quantum information processing, and the area 
of quantum process tomography is nourishing [U HJ El H21 IT8] . 

Direct quantum process tomography is performed by sending known quantum systems 
into the channel, and then estimating the output state. In quantum mechanics the mea- 
surement has a probabilistic nature [Til [T2] . therefore many identical copies of the input 
quantum system are needed, and an estimator is constructed by using statistical consid- 
erations. For achieving efficient process tomography, experiment design is necessary that 
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consists of selecting the optimal input state, optimal measurement of the output state, 
and an efficient estimator of the channel from the measured data. 

The field of quantum process tomography is well-established, an exhaustive description 
of possible tomography methods can be found in [9J . The Pauli channels form a relatively 
wide family of quantum channels. The tomography of Pauli channels has a huge literature, 
however, due to the level of difficulty of the topic, papers mostly deal with special cases, 
e.g., with the optimal parameter estimation of a depolarizing channel [IT]. But there are 
some publications investigating the estimation of multi-parameter channels jU [TS], and 
the multidimensional case also appears [TU]. There are also some experimental results 
concerning the optimal estimation of the Pauli-channels [5j [6]. 

In contrast to the majority of the works in this area, we propose an extended problem 
statement: we investigate qubit Pauli channels with unknown channel directions. Despite 
of the novelty of the approach, there are a few papers that deal with optimally estimating 
qubit Pauli channels including their channel directions. In [2j the problem was examined 
using convex optimization methods, and a numerical method was provided for finding the 
optimal input - measurement pairs. In [16J we examined the optimality of the estimation 
problem using purely statistical considerations to achieve analytical results. However, 
analytical results could only be obtained for the case of known channel directions. 

Therefore, the aim of this paper is to give an analytical description of the optimal 
estimation of Pauli channels in the case of unknown channel directions, too. The efficiency 
of these estimations is measured here with three quantities: the mean squared error of 
the estimated contraction parameters and angle parameters, and the mean distance of 
the estimated and the real channel matrix are investigated. 

The paper is organized as follows. In Section 2 we introduce the necessary notions to 
understand the rest of the article. In Section 3 the tomography method used is described. 
In Section 4 the optimization of the previously mentioned quantities are performed. Fi- 
nally, conclusions are drawn. 

2 Preliminaries 

In the following section, we give a short introduction to the applied concepts. A more 
detailed description can be found in [TT| li~2] . We will only examine two- level systems, 
that is quantum bits or qubits. 

The state of two-level quantum systems is described by density matrices p G M2(C) 
that are parametrized by a real vector 9 e R 3 called the Block vector. 

Definition 1 (Bloch parametrization). 

p : R 3 -)■ M 2 (C); 9 = (9 1 , 6 2 , 6 3 ) T H- p{6) = i (/ + 9 1 a 1 + 9 2 a 2 + Ms, ) (2.1) 



where 



1 \ / 1 \ / -i \ / 1 



a ° =- l o i ' ai = 1 ' ° 2 = { i o ' a3 = V -1 



Definition 2 (Quantum state). A qubit can be described with 2x2 density matrices (p{9)) 
satisfying the following condition: 

Tr(p(0)) = 1 
pifi) > 0. 



It is easy to check that p(9) is a density matrix if and only if 9\ + 9\ + 9\ < 1. That 
is, the state space can be represented with the unit ball in R 3 , the so-called Bloch ball. 

We will use von Neumann measurements with two possible outcomes |14| . 

Definition 3 (Measurement). {P, I — P} is a von Neumann measurement if P is 2 x 2 
projection. The probability of measuring outcome P on the system p{9) is Tr(p(8)P). 

The only non-trivial case is when P is a rank-one projection. Then using the same 
Bloch parametrization as previously, we can rewrite P in the following form: 

P = T (°b + m i a i + m 20~2 + fn 3 a 3 ) , (2.2) 

where ml + m\ + m\ = 1. Using the abbreviation m = (m 1 ,m 2 , m 3 ) T and 9 = {9i,9 2 , 9 3 ) T 
we obtain 

Prob(" measuring P") = Tr(p(9)P) = -(l + m-9). 

Quantum channels are completely positive, trace-preserving maps, and Pauli channels 
are a well-known family of them in the qubit case. 

Definition 4 (Pauli channel). Let be {vq = I,Vi,V2,Vs} an arbitrary base satisfying 
Tr(viVj) = 25ij ,Vi G M|' a '(C) Vi,j G {0, 1,2,3}. Let be Ai, A2, A3 real numbers that fulfill 

1±A 3 > |Ai±A 2 |. (2.3) 

Then a Pauli channel can be described with a mapping 



£ : M 2 (C) ^ M 2 (C); p = \ ( I + J>v< J H- £{p) = \ [i + ^XiOivA . (2.4) 




The affine subspaces {| (J + tvj) : £ G IR} C M2' a (C) (i G {1, 2, 3}j are called the channel 
directions, the numbers Ai, A 2 , A3 are called the contraction parameters. 

The completely positiveness of the mapping is guaranteed by condition (12. 3p (see 
p]). From (E3D follows that -1 < Aj < 1, i G {1,2,3}. If we look at the effect of a 
Pauli channel on the Bloch vectors then it becomes clear that we have contractions with 
parameter \ in the appropriate channel directions. In other words, the image of the 
whole Bloch ball will be an ellipsoid with its axes lying in the directions of the channel 
and being 2|A»| long. 

Definition 5 (Channel matrix). We will call the mapping A : R 3 — y R 3 the channel 
matrix of Pauli channel £, if 

£op = poA. (2.5) 

Straightforward calculations show that A is a linear mapping and it can be always 
parametrized in the following form: 





Theorem 1. The channel matrix A of every Pauli channel can be constructed as 

A(Ai, A 2 , A 3 , (f> Zi K <t>*) = RzRyR.AR^Ry'K 1 , (2.6) 

where 

R z (d> z ) = 



R, 



Note that this parametrization is only surjective, not bijective. This means that we 
can get the same channel matrix for several set of parameters. 

3 Tomography of Pauli channels 

Using the notions described in the previous section, we can characterize the Pauli channel 
with its channel matrix (A). Furthermore, Theorem [1] shows that the channel matrix can 
be constructed by using three contraction parameters (A,) and three angle parameters 
((pi). Our aim is to give the best estimation of these quantities. 

For this purpose we will send some input qubits through the channel, perform some 
measurements and construct an estimator from the measured data. 

Input qubits and measurements We have to have at least three different input 
qubits for complete channel tomography. Let us suppose that we have three different 
measurements, too. Previous investigations showed that one should choose pure input 
states, which are orthogonal to each other [2j,[16j. 

Let us suppose that the Bloch vectors of the input states are 9} 1 ' , 6} 2 ' , 6} 3 ' . If they are 
orthogonal, they create an orthogonal matrix and we can parametrize them the following 
way 



e 



0(1)^(2)^(3) = R z ($ z )R y (# y )R x (# x ) : (3.1) 



where R z ,R y , R x are the same rotations as in (12. 6p and < $ z , fly < 7T, < i? x < |. 

Similarly, we suppose that the measurements with Bloch vectors m^ , rrS 2 \ rnP^ are 
orthogonal |16| . Thus, we can parametrize them using one-dimensional rotations, too: 

M = [m (1) ,m (2) ,m (3) ] = R z (r z )R y (r y )R x (T x ) (3.2) 

where < r z , r y < %, < r x < |. 

Estimation of the channel matrix The matrix generated from the Bloch vectors of 



the output qubits S 



t(l) t(2) t(3) 



fulfill 



E = A(\x, X 2 , A 3 , (f) z , (f) y , (f) x )®- (3.3) 



Let us suppose that we perform the i-th measurement A?" times on the copies of the 
j-th output qubit and denote by Nij the number of measurement results corresponding 
to m?'. Then N^ is binomially distributed with the following parameters 

N i3 ~ Binom N, ~ 2 = J . (3.4) 

We can use the notation Xij = m^ ■ £ , V i,j G {1,2,3}, then X = {xij}fj =1 can be 
written in the form 

X = M T E. (3.5) 

From (j3.4p we can estimate the elements of X (since E(A^j) = iV • — |^-): 

2 
Xij ■= — ■ Nij-1. (3.6) 

Furthermore, the elements of S can also be estimeted from (13. 5p as 

S := MX. (3.7) 

Finally, from (13. 3p we can obtain an estimation of the channel matrix 

A := S0" 1 = MXQ T . (3.8) 

Estimation of the channel parameters From (12. 6p we know the mapping from the 
sets of parameters to the channel matrices: 

A : V -¥ M 3 (R); (A x , A 2 , A 3 , <f) z , <p y , <f) x ) H- A(X 1 , A 2 , A 3 , <fi z , <f> y , <t> x ), 

where T> C 1R 6 , and our aim is to define its inverse mapping, i.e., we want to estimate the 
channel parameters from the estimation of the channel matrix (13. 8p . That is we need to 
find the mapping 

T : M 3 (K) -)• M 6 ; A i-> (Ai, A 2 , A 3 , Z , j> y , £,), 

which fulfills 

ToA = ld v . (3.9) 

We can construct T the following way. Let us symmetrize our estimation: 

A s := l -(A + A T 

Then A s can be diagonalized in an orthonormal basis. The Jordan decomposition of A s 
can be calculated easily algebraically since its characteristic polynomial is cubic. This 
way we get the eigenvalues Ai > A 2 > A 3 , which are exactly the estimates of the channel 
contraction parameters, and the corresponding normalized eigenvectors: Vi,v 2 ,v 3 are 
also obtained. 

The next question is how we can calculate the angle parameters from these eigenvec- 
tors. From (12. 6p we can see the geometrical meaning of the angle parameters: <p z and (p y 
are the polar and azimuth angles of vi, while cj) x is the angle of v 2 and the intersection 
of planes z = and Vi (orthogonal subspace of vi). 

These quantities can be uniquely determined only if Ai > A 2 > A 3 , so we need to 
restrict the domain of mapping A (V) to get a one-to-one correspondence between channel 
matrices and channel parameters. 



Theorem 2. The parametrization will be bijective on the following domain: 

V = {(\ u \ 2 ,\ 3 , ( f )z , ( J) y , ( f )x )em 6 : 1±A 3 > |Ai±A 2 |, Ai> A 2 > A 3 , 

7T 
(f>z, ^y: 4>x G [0, 7T), 4> y = - => (f) z = 0, Ai = A 2 = A 3 => <p z = <f> y = (j) x = 0, 

Ai = A 2 > A 3 => (<f> x = and <p y = =>• </> 2 = 0), 

Ai > A 2 = A 3 =>- 0a: = 0}. 

4 Optimal input states and measurements 

Let us introduce some abbreviations: 

A = [Ai, A 2 , A 3 ] — channel contraction parameters, 
4> — [ ( t ) z: ( t ) y^4 > x\ ~ channel angle parameters, 
t = [r z , T y , t x ] — measurement parameters, 
•d_ = [& z , $ y , fix] ~ input qubits parameters. 

In the previous section we gave an estimation method of the channel matrix (A), con- 
traction parameters (A) and angle parameters (0), now we want to analyze the estimates. 

The distance between the real and estimated parameters can be given in various ways 
[3]. Let us define our loss functions as the averaged squared errors of the previously 
mentioned quantities: 

/(A,0,r,^,^) = E(p(A 1 ,A 2 ,A 3 ,0,,^,^)-A(A 1 ,A 2 ,A3,0,,0j / ,0 :c )|| 2 ), (4.1) 

g&hz&N) = E ((Ax - Ax) 2 + (A 2 - A 2 ) 2 + (A 3 - A 3 ) 2 ) , (4.2) 

h(\,$,Z,ti,N) = E(dist(^,^) 2 + dist(0j / ,^) 2 + dist(4,^) 2 ) , (4.3) 

where || • || is the Hilbert-Schmidt norm, dist(0, (f) '.— inf{|0 — ((f) + kir)\ : k G Z}. 

Remark. The estimation error of the channel matrix is equivalent to the average estima- 
tion error of output qubits, since 

3/3 \ 2 

[\\£( P (6))-£(p(6))\\ 2 de= J \\\Ae-Ae\\ 2 dd = l - f "£ lj2(A ij - A^eA de = 

33 3 

= \ [ E E ^(4* - 4) 2 ^ + 1 I E E MMA*i - 4)(A* - A lk )de = 

{ *=1 3=1 { t=l j<k 

= \ E E f / e ' de ) (^ - 4) 2 + E E f / w» ) (^ - 4-XA* - A fe ) = 

4=1 i =1 \e / i=1 i< k Ye / 

= c\\A-A\\ 2 , 

where in the last step we used the symmetry of Bloch-ball: J e 6 2 d6 =constant, J e 6j9kd9 = 
0. 

In the following we will find the optimal inputs and measurements for a given channel 
(A, <p) and number of measurements (N) that minimize the above loss functions. 
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4.1 The optimal estimation of the channel matrix 

Before obtaining the optimal estimation settings, we will introduce some useful state- 
ments. 

Proposition 3 (Rotational invariance). For any O G M 3 (R) orthogonal matrix, the 
estimation of the Pauli channel described by the channel matrix A using the input and 
measurement settings and M (see Ii3. 1\) , hS.ty ) is exactly as efficient as the estimation 
of the Pauli channel described by OAO" 1 with the input and measurement settings O0 
and OM. 

Therefore, it is enough to investigate Pauli channels with channel parameters <p z = 

Lemma 4 (Unbiasedness 1). The estimation of the channel matrix given in Ii3.8\) is 
unbiased. 

Proof. The estimation of the elements of X defined in (13.61) is unbiased (that is, EX = X). 
The estimator of (13. 8 p is a linear function of X, the expected value is linear, too, so 

EA = A. □ 

Let us introduce the notation a^ = [A]^ (i,j G {1,2,3}) and the index set H = 
{11, 12, ... , 32, 33}. A is a linear transformation of X, so 

where the constants come from the actual values of and M, hence determined by the 
parameters ;# and r. 

Lemma 5. Set ip = ^^eH^kO-k {dk G K). Then 

Var(V0 = £(5>c w (r,tf)) ^- (4.5) 



l£H \k£H 



Proof. From (13. 4p and (13. 6p it is easy to calculate the distribution of the elements of X. 
Using the well known properties of the binomial distribution and the independence of the 
different elements of X, straightforward calculations verify the statement. □ 



Proposition 6. The matrix {cki}k,ieH G M 9 (R) defined by equation ft4-4\) ^ s orthogonal, 



because and M are orthogonal matrices. From this follows that the Hadamard- square of 
this matrix, {c 2 kl \k,i£H is bistochastic. That is J2k&H c ti = 1 V/ G i/, J2ieH c li = 1 VA; G i?. 

Theorem 7. 

/(A, 0, t, & N) > 1 (6 - (A? + X\ + \D) , (4.6) 



an 



d Iji4.6\ ) holds with equality, if r = $ = 0. 



Proof. The distance of the channel matrix and its estimation: 

\\A(\i,\ 2 ,\3,<l>z,(j>v,<l>x) -A(\x,\ 2 ,\3,(i>z,<f>y,<f>x) l| 2 = ||^(Al,A 2 , A3, (f) z , (f) y , <j) x ) - A s \\ 2 

/- \2 , /- \2 , /- \2 , o / fl 12 ~~ a 12 . a 21 — a 21 \ 

= (an - an) + (a 22 - a 22 ) + (a 33 - a 33 ) + 2 I 1 I + 

v 2 /* * \ 2 

.-, | Ql3 ~ a 13 Q31 ~ Q 31 \ r, / a 23 ~ ^23 «32 ~ ^32 



2 2 / V 2 2 

From Lemma H] the mean squared error of the a^-s can be written as their variance, hence 

/(A,O,r,^,^)=E(||A(A 1 ,A 2 ,A 3 ,0 2 ,^,0,)-^(Ai,A 2 ,A 3 ,^,0,,4)l| 2 ) = 
= - ( Var(a i2 + a 2 i) + Var(ai 3 + a 31 ) + Var(a 23 + a 32 ) J + ^ Var(aii) = 

^ ' ie{l,2,3} 

= -- ( Var(ai 2 - a 21 ) + Var(&i 3 - a 31 ) + Var(a 23 - a 32 ) J + J^ Var(ay). 
By Lemma [5] 



Var(a fc ) = ^4(r,^)i^, 
leH 

hence using the bistochastic property of the matrix {c 2 kl \k,ieH described in Proposition [61 
we can see that 

E v« ( fc) = e E 4(i, fi)^ = E ^ 4 f 9 - £ *■ ) ■ (4J) 

feGif fcGH zetf " leH \ leH J 



On the other hand, Lemma shows that 



1 - x 2 
Var (a m - a„) = 2j(cmj - c n ;) 2 — — -i. (4i 



ZGff 



From the orthogonality of {cki}k,ieH follows that Y1i^h( c "iI~ c ni) 2 — 2, since this expression 

N — N 



is the norm-square of orthogonal unit vectors in IR 9 . Therefore — ^- < -^ implies that 



Var(a m - a n ) < ^. 
Finally 

^xj = TrXX T = Ty{M- 1 AQ){Q- 1 A t M) = Ti AA T = A 2 + \\ + A 2 . (4.9) 
leH 

hence 

/(A,0,r,^,iV)>-I(l + l + l) + l(9-(A? + A 2 + A 2 )). 

which is equivalent to the inequality (14. 6 p . It is easy to check that in the r = $ — case 

Var(o ii ) = -^(1 -^A 2 ), z,j e {1,2,3}, 

2 
Var(a i2 - a 2 i) = Var(ai 3 - a 3 i) = Var(a 23 - a 32 ) = — , 

hence the minimum of /(A, 0, r, #, N) is taken in r = $ = 0. D 



4.2 The optimal estimation of the contraction parameters 

It is difficult to compute the loss functions g and h defined in (14.21) and (14. 3p . because 
the Aj (i G {1,2,3}) and the (j) a (a G {z,y,x}) parameter estimators are non-linear. 
However, one can approximate these parameter estimators with the their first-order Taylor 
polynomial (recall that the Aj, (fi a estimators are M 3 (M) — > K. functions). Lemma H] shows 
that E (Jiij) = dij, while from Lemma [5] it follows that Var (ay) = 0(-^)- Hence the 

Taylor polynomial of functions Aj, <p a with the base point E(A) = A will be an appropriate 
approximation, if N is large enough. 

So the linearized estimators can be written in the form 



A, := Xi(A) + ^gradA^A), A - A) (V i G {1, 2, 3}), (4.10) 

a := a (A) + (grad0 a (A), A - A) (V a e {z,y,x}). (4.11) 



Lemma 8 (Unbiasedness 2). The above defined Aj (i G {1,2,3}), ^> a (a G {z,2/,x}) 
estimations of the channel parameters are unbiased. 

Proof. It follows from (13. 9p that we have A» (A(X, </>)) = Aj and <\> a (A(A, 0)) = <\> a . Using 
the unbiasedness of A we get 

E(\i) = E{\i(A)) + E ((gradA^A), A - a)) = A<(A) + (grad\(A), E(i - A)) = 

= Ai(A) + = A,, 
and similarly 

E(0 Q ) = E(0«(A)) + E ((grad<^(A), A - a}) = <f> a . 

D 

According to Proposition [3j it is enough to consider the case when the Pauli channel 
described by A has angle parameters (p z = (fi y = (fi x = 0. 
Let us define the mapping T with formula 

T : M 3 (K) ->• K 6 ; A H> (Ai, A 2 , A 3 , 2 , ^, 0.) 

Lemma 9. The non-vanishing components ofdT(A(X,0)) are 

dX 1 dX 2 <9A 3 d(p z 1 dcj>y 1 d(f) x 1 



da u da 22 da 33 ' da 12tS X 1 - A 2 ' da 13tS A1-A3' <9a 23iS A 2 -A 3 ' 

w/iere a ij)S = ~ (ay + a^) . 

Proof. The parameter estimation T is the left-inverse of the channel parametrization 

A : V ->■ M 3 (R); (Ai, A 2 , A 3 , ^, y , X ) h-> A(Ai, A 2 , A 3 , z , y , X ), 

hence 

dT(A(A,0)) = (dA(A,0)) _1 . (4.12) 
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We can calculate the elements of oM(A, (/)) easily. For example: 

(Ai cos 2 <p z + A 2 sin 2 <p z (Ai — A 2 ) sm<p z cos <fi z 
(Ai - A 2 ) sin (f) z cos <j> z Ai sin 2 4> z + A 2 cos 2 (f> z 
1 

so 

qa ( (Ai-Aa) 

— -(Ai, A 2 ,A 3 , 0,0,0)= (Ai-A 2 ) 

d ^ \ 

From similar calculations we get that 

dA(Ai, A 2 , A 3 , 0, 0, 0) = Diag(l, 1, 1, X 1 - A 2 , Ai - A 3 , A 2 - A 3 ). 
from which the statement follows. □ 

Let us substitute the result of Lemma O into the definition of Aj (see (|4.10jl ). and use 
the unbiasedness of Aj, and A. Then the loss function defined by the linearized parameter 
estimators will have the following form: 

S(A, 0, r, 0, N) : = E ((A x - A x ) 2 + (A 2 - A 2 ) 2 + (A 3 - A 3 ) 2 ) = 

= Var(on) + Var(a 22 ) + Var(a 33 ). (4.13) 

This function is asymptotically equal to g, but easier to handle, thus, we can perform 
the analytical optimization. 

Theorem 10. 

S(A, Q, r,& iV) > 1 (3 - (A 2 + A 2 + A 2 )) , (4.14) 

and Iji4-14\ ) holds with equality, if r = §_ = 0. 
Proof. 

g(£,0,z,& t N)= J2 Var ( a -)= J2 VarCa^-^Var^). (4.15) 

ie{l,2,3} i,ie{l,2,3} &j 

(14. 5 p shows that Var(a^) < -L (Vi,j), since ^ < -^ and the {c 2 ^}^^ matrix is 
bistochastic. Therefore 

JJVar&j) < — . 

Now, by the results of (14.71) and (14. 9 p we can write the following inequality based on the 
equation (14. 15ft : 

Var(ou) + Var(a 22 ) + Var(a 33 ) > -^ (9 - (A 2 + A 2 + A 2 )) - ^, 

and this is the statement of the theorem. It is easy to see that in the r = $ = case 
Var(on) = £(1-A?), Var(a 22 ) = ^(1-A 2 ), Var(a 33 ) = ^(1-A 2 ), hence g(\,0,r,ti, N) 
is minimal in r = $ = 0. D 
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4.3 The optimal estimation of the angle parameters 

From Lemma M we can determine the loss function that measures the accuracy of the 
estimation of the angle parameters: 

h(X,0,T,±,N) = E ((& - cf> z ) 2 + (0, - 4> y f + {4> x - </g 2 ) = 

= 777 TW Var ( ai2 + a21 ) + 777 rT^ Var ( a i3 + 631) + 777 -^Var(a 2 3 + a 32 ). 

4(Ai - \ 2 y 4(Ai - A 3 ) 2 4(A 2 - A3) 2 

Using the result of Lemma El Var(ai 2 + S21), Var(ai3 + S31) and Var(a 2 3 + a3 2 ) can be 
expressed as a function of r,$ and X = {a;/}^^. The definition of matrices and M 
and equations ( 13. 3p and (13. 5p show that X can be written as a function of r, 1? and A 
(recall that we fixed = 0). Hence, h(\,0,T,$,N) can be written in a quite extensive, 
but explicit, closed form. 

For fixed A and N, the optimization of h(X,0,r, #, iV) is a minimization problem with 
six variables (T z ,T y ,T x ,'d z ,'&y,'& x ), but unfortunately this optimization problem can not 
be solved analytically. 

However, we can formulate conjectures based on numerical optimization computations. 

Conjecture 11. For any fixed parameters A, N 

• »/ ft(A> Q, £, g, N ) is minimal at (r^,^), then r opt = g^, 

• the estimation strategies described by parameters r_ x — §_i = (f , f , 0) and r 2 = $ 2 = 
(j,0,^) are nearly optimal. 

The following examples taken from our numerical optimizations studies indicate the 
validity of Conjecture [TT1 

Example 12. Let us fix the following parameters: Ai = 0.8, A 2 = 0.65, A3 = 0.5 and 
N = 1000. Then the optimal {r_ ^ t , r d_ opi ) (we can calculate them numerically) does not 
show any regularity except r opt = '& opt , with 

wJnh = h(r <vt ,0 opt ) = 0.03634. 

We can calculate the values at the two points given in Conjecture[Jl\ h(r 1 , ^_ x ) = /i(V 2 ,:# 2 ) = 
0.03676. So the difference can be considered small compared to h(0, 0) = 0.05. 

Example 13. The situation is similar for the parameters Ai = 0.9, A 2 = 0.67, A3 = 0.6 
and N = 1000. For the optimal input and measurement directions we have r opt = l d_ opt 
from numerical optimization, with 

wmh = h(r cpt) opt ) = 0.01659. 

T, 17 

In comparison, h^Zi^i) = h{r_ 2l 'd_ 2 ) = 0.01675 and h(0,0) = 0.02446. 
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Pauli channel with known parameters in one direction Let us assume that we 
have some information about the Pauli channel: 



v 3 = a 3 and A 3 = 0. 
In this case the channel matrix has the following simplified form 

A(\ 1 ,\ 2 ,<p) = R(<p)A(\ 1 ,\ 2 )R( ( f ) r 1 . 



(4.16) 



(4.17) 



Now its is easy to show that the input states and the von Neumann measurements should 
be orthogonal vectors in the plane spanned by a\ and a 2 . Hence, the input states and the 
measurements can be parametrized using a single angle parameter d and r, respectively: 



6 



9u 9-21 

#12 #22 

1 



R(#), M 



fflll m 2 l 

777i2 77722 

1 



R( 



T . 



Then we have to solve the following minimization problem with two variables (r, $) 
for fixed X 1 ,X 2 ,N values: 



h 2 (\ 1 ,\ 2 ,0,T,$,N) = E 



(4.18) 



It can be solved analytically, but the proof is rather technical than difficult. The result is 
formulated in the following theorem: 

Theorem 14. Assume that Ai > A2 and Ai 7^ — A2. 

1. If (Ai + A 2 ) 2 > 2(Ai — A2) 2 , then h 2 (Xi, X 2 , 0, r, ■#, N) is minimal if and only if 

7P 



'opt "opt 



n 
4 



mod 



The minimal value of the loss function is 



^ (Al ' A2 '°'I'I' Af) = 4(A 1 -A 2 )^ 



(4-(A 1 + A 2 ) 2 ). 



2. If (Ai + A2) 2 < 2(Ai — A2) 2 , then h 2 (Xi, X 2 , 0, r, ■$, N) is minimal if and only if 



T op t = "& pt = x or --x (mod - , 



where x = \ arccos ( ~ 2 \\ + _\ \2 ) ■ The minimal value 



is now 



MA 1 ,A 2 ,0,^,^,i V )^ i ^l^ 5 L(4-(A; + AS-igi±g 



Example 15. // Ai = 0.8 and X 2 = 0.2 the optimal input and measurement directions 
are Topt = $o P t = f ■ These are the optimal angles in most cases, however, if X± — 1 and 
X 2 = then r opt = 79 opt = | or |. 
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5 Conclusion and discussion 

In Theorem [T] we gave a parametrization of the channel matrix A, while in Theorem [2] 
we obtained the domain where the parametrization is bijective. We gave an estimation 
procedure for the channel matrix and channel parameters in Section 3, and then optimized 
this process with respect to input qubit and measurement directions. We have proven that 
for the optimal estimation of a channel matrix, we have to take input qubit - measurement 
pairs in the channel directions (Theorem [7j), and the same statement is true for estimating 
the channel contraction parameters (Theorem [TU]) . This result can be implemented using 
a two-step algorithm for optimally estimating the channel parameters, where in the first, 
shorter step we make a rough estimation on the channel directions, and then we set 
these directions for the second step of the algorithm when the contraction parameters are 
obtained. 

The estimation of the angle parameters can not be performed analytically in the 
general case, and the loss function has a quite complex form even in a simplified case. 
From simulation investigations we conjecture that using a complementary basis to the 
channel directions would give a nearly optimal result. 
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